Efficient reinforcement learning: computational theories, neuroscience and robotics.

نویسندگان

  • Mitsuo Kawato
  • Kazuyuki Samejima
چکیده

Reinforcement learning algorithms have provided some of the most influential computational theories for behavioral learning that depends on reward and penalty. After briefly reviewing supporting experimental data, this paper tackles three difficult theoretical issues that remain to be explored. First, plain reinforcement learning is much too slow to be considered a plausible brain model. Second, although the temporal-difference error has an important role both in theory and in experiments, how to compute it remains an enigma. Third, function of all brain areas, including the cerebral cortex, cerebellum, brainstem and basal ganglia, seems to necessitate a new computational framework. Computational studies that emphasize meta-parameters, hierarchy, modularity and supervised learning to resolve these issues are reviewed here, together with the related experimental data.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Computational Theories of Curiosity-Driven Learning

What are the functions of curiosity? What are the mechanisms of curiosity-driven learning? We approach these questions using concepts and tools from machine learning and developmental robotics. We argue that curiosity-driven learning enables organisms to make discoveries to solve complex problems with rare or deceptive rewards. By fostering exploration and discovery of a diversity of behavioura...

متن کامل

Thesis Proposal: Efficient and Tractable Methods for System Identification through Supervised Learning

Latent state dynamical systems constitute an essential modeling tool for various applications including natural language processing, econometrics, robotics and computational neuroscience. Still, many existing approaches to learning dynamical systems can suffer from high computational demand, lack of theoretical guarantees or modeling limitations such as inability to represent continuous systems...

متن کامل

Using BELBIC based optimal controller for omni-directional threewheel robots model identified by LOLIMOT

In this paper, an intelligent controller is applied to control omni-directional robots motion. First, the dynamics of the three wheel robots, as a nonlinear plant with considerable uncertainties, is identified using an efficient algorithm of training, named LoLiMoT. Then, an intelligent controller based on brain emotional learning algorithm is applied to the identified model. This emotional l...

متن کامل

Computational Models of Cognitive and Motor Control

Most of the earliest work in both experimental and theoretical/computational systems neuroscience focused on sensory systems and the peripheral (spinal) control of movement. However, over the last three decades, attention has turned increasingly towards “higher” functions related to cognition, decision-making and voluntary behavior. Experimental studies have shown that specific brain structures...

متن کامل

Closed Loop Interactions between Spiking Neural Network and Robotic Simulators Based on MUSIC and ROS

In order to properly assess the function and computational properties of simulated neural systems, it is necessary to account for the nature of the stimuli that drive the system. However, providing stimuli that are rich and yet both reproducible and amenable to experimental manipulations is technically challenging, and even more so if a closed-loop scenario is required. In this work, we present...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Current opinion in neurobiology

دوره 17 2  شماره 

صفحات  -

تاریخ انتشار 2007